Late Latin Charter Treebank: contents and annotation

نویسندگان

چکیده

This paper describes the construction and annotation of Late Latin Charter Treebank, a set three dependency treebanks (llct1, llct2 llct3) which together contain 1,261 Early Medieval documentary texts (i.e., original charters) written in Italy between ad 714 1000 (about 594,000 tokens). The focusses on matters linguistically or philologically inclined user llct needs to know: criteria charters were selected, special characteristics types utilised, geographical chronological distribution data. In addition normal queries forms, lemmas, morphology syntax, complex philological research settings are enabled by textual layer llct, indicates abbreviated damaged words, as well formulaic non-formulaic passages each charter.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Entity And Treebank Annotation

We describe a parallel annotation approach for PubMed abstracts. It includes both entity/relation annotation and a treebank containing syntactic structure, with a goal of mapping entities to constituents in the treebank. Crucial to this approach is a modification of the Penn Treebank guidelines and the characterization of entities as relation components, which allows the integration of the enti...

متن کامل

ITU Treebank Annotation Tool

In this paper, we present a treebank annotation tool developed for processing Turkish sentences. The tool consists of three different annotation stages; morphological analysis, morphological disambiguation and syntax analysis. Each of these stages are integrated with existing analyzers in order to guide human annotators. Our semiautomatic treebank annotation tool is currently used both for crea...

متن کامل

Automation of Treebank Annotation

Thorsten Brants and Wojciech Skut Universit at des Saarlandes Computational Linguistics D-66041 Saarbr ucken, Germany fbrants,[email protected] Abstract This paper describes applications of stochastic and symbolic NLP methods to treebank annotation. In particular we focus on (1) the automation of treebank annotation, (2) the comparison of con icting annotations for the same sentence and (3...

متن کامل

The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin

The paper describes the treatment of some specific syntactic constructions in two treebanks of Latin according to a common set of annotation guidelines. Both projects work within the theoretical framework of Dependency Grammar, which has been demonstrated to be an especially appropriate framework for the representation of languages with a moderately free word order, where the linear order of co...

متن کامل

Porting an Ancient Greek and Latin Treebank

We have recently converted a dependency treebank, consisting of ancient Greek and Latin texts, from one annotation scheme to another that was independently designed. This paper makes two observations about this conversion process. First, we show that, despite significant surface differences between the two treebanks, a number of straightforward transformation rules yield a substantial level of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Corpora

سال: 2021

ISSN: ['1755-1676', '1749-5032']

DOI: https://doi.org/10.3366/cor.2021.0217